The Khmer Script Tamed by the Lion ( of TEX )

نویسنده

  • Yannis Haralambous
چکیده

This paper presents a Khmer typesetting system, based on TEX, METAFONT, and an ANSI-C filter. A 128-character of the -/-bit ASCII table for the Khmer script is proposed. Input of text is done phonically (using the spoken order consonantsubscript consonant-second subscript consonant-vowel-diacritic). The filter converts phonic description of consonantal clusters into a graphc T~Xnical description of these. Thanks to TEX booleans, independent vowels can be automatically decomposed accordmg to recent reforms of Khmer spelling. The last section presents a forthcoming implementation of Khmer into a 16-bit TEX output font, solving the kerning problem of consonantal clusters. Introduction to Khmer Script The Khmer script is used to write Khmer, the official language of the Cambodian Republic, and belongs to the Mon-Khmer group of Austroasiatic languages. It is a very old and beautiful script, and from the typesetter's point of view, one of the most challenging and exciting scripts in the world. To understand the complications of Khmer typesetting, we will start with a quick overview of the Khmer writing system. Khmer is written from left to right; the Khmer alphabet has 32 consonants, the following: f i Z ~ Z ~ ' L 7 ~ ~ ~ b T ~ ~ 4 6 ~ 6 1 $ 1 % i I ? 1 ~ ~ 6 ~ ' 6 6 1 ' 6 ~ ~ h ; d 6 ~ i $ ~ U l ~ The character H denotes the absence of a consonant. From the typesetter's point of view and with respect to collating order, it might as well be considered as a consonant. We wdl use a box U to denote an arbitrary consonant. These 33 "consonants" (except 4) can appear in the form of subscript consonants: LO 0 0 OJ 0 0 oi d cr H A subscript consonant is pronounced after the "primary" consonant. Nevertheless, as the reader has certalnly noticed, the subscript consonant I3 is written on the left of the primary consonant. It is also possible to have two subscript consonants carried by the same primary consonant. In that case, the second subscript consonant has to be LO. Examples: k, N. Ln I. A consonant, consonant + subscript or consonant + double subscript combination can carry a vowel. There are 28 vowels: Although vowels are always pronounced after consonants, their graphical representation literally surrounds the consonant/subscript combination: they can appear above, beneath, on the right or on the left of consonants. Often a vowel's glyph has two or three non-connected parts. When combining vowels with subscript consonants, the following graphical rules are followed: if the subscript has a right protrudmg stem then the vowel 01 connects to the subscript and not to the consonant: GJ + G? = 9 etc. if the consonant carries both a subscript LU and a vowel with left branch, then the latter is placed on the left of the former: LG + 60 = 610 etc. if the consonant carries both a subscript consonant and a subscript vowel, then the latter is placed underneath the former: + Q = i, 10+ g = IG etc.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Typesetting Khmer

Because of the complexity of Khmer script, up to now there has been neither a typesetting system nor standard encoding for the Khmer language. Presented in this paper are: (a) a complete typesetting system for Khmer based on TEX, METAFONT and an ANSI C preprocessor, as well as (b) a proposal for an 8-bit encoding table for Khmer information interchange. Problems of phonic input, subscript and s...

متن کامل

Comparison of Approaches for Language Revitalization of Northern Khmer in Thailand

Although 1.4 million people speak Northern Khmer in Thailand, they are aware that their language is still in decline. To deal with this threat, native speakers have cooperated with linguists from Mahidol University to work on a community-based research project since 2007. Teaching the Northern Khmer language as a subject in the formal school system was the first project which started at Ban Pho...

متن کامل

The Lion-Bull Motifs of Persepolis: The Zoogeographic Context.

The lion-bull iconography during the Achaemenian period of ancient Persia has generated different theories of astronomicaland seasonal events besides the suggestions that it could be the symbol of the time cycle of the day, with lion representingthe sun and bull the night. However, the present paper draws the reader’s attention to the hitherto unexplored angle ofzoology to understand physiognom...

متن کامل

The History of Uighur script and calligraphy in Persian manuscripts

  Abstract After Mongol invasion into Iranian plateau new cultural elements entered by the invaders which influenced on some aspects of Persian book art. Uighur script which first was used to write Mongol and then eastern Turkish languages, appeared in Persian Manuscripts which were produced for Timurid governors and some of famous works are remained from Yazd, Herat, Guilan and Shiraz. These...

متن کامل

Investigating the Energy Efficiency of TEX High Energy Derivatives with Different Carbon Fuller Nano Structures under Different Temperature Conditions by DFT Method

In this study, high energy energy derivatives of TEX with different carbon-containing fullerenes at different temperature conditions were studied using density functional theory. For this purpose, the materials were first geometric optimized, then the thermodynamic parameters were calculated for all of them. Then, the process of changing the energy-dependent parameters such as specific heat cap...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1993